Rank in Wordlist | Frequency | Word |
---|---|---|
7607 | 3 | 2,5 |
9913 | 2 | 0,5 |
14863 | 1 | 1,25إلى |
14934 | 1 | 14,2 |
15182 | 1 | 4,5 |
15268 | 1 | 7,8 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4528 | 6 | 20% |
6204 | 4 | 60% |
7623 | 3 | 80% |
9919 | 2 | 10% |
9920 | 2 | 100% |
9930 | 2 | 18% |
9962 | 2 | 35% |
9968 | 2 | 50% |
9973 | 2 | 7% |
9978 | 2 | 87% |
Rank in Wordlist | Frequency | Word |
---|---|---|
22135 | 1 | بدوره'أن |
Rank in Wordlist | Frequency | Word |
---|---|---|
9960 | 2 | 3/1 |
14842 | 1 | 004/2010 |
14843 | 1 | 005/2011 |
14849 | 1 | 0163/2011 |
14850 | 1 | 017/2006 |
14855 | 1 | 04/92 |
14866 | 1 | 1/0 |
14893 | 1 | 1123/2010 |
14940 | 1 | 1431/2010 |
15034 | 1 | 2/ |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots